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Abstract. The paper gives a Banach space -valued extension of the Tb the- 
orem of Nazarov, Treil and Volberg (2003) concerning the boundedness of 
singular integral operators with respect to a measure fi, which only satisfies an 
upper control on the size of balls. Under the same assumptions as in their re- 
sult, such operators are shown to be bounded on the Bochner spaces L^ifi; X) 
of functions with values in X — a Banach space with the unconditionality prop- 
erty of martingale differences (UMD). The new proof deals directly with all 
p G (1,00) and relies on delicate estimates for the non- homogenous "Haar" 
functions, as well as McConnell's (1989) decoupling inequality for tangent 
martingale differences. 
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1. Introduction 

The aim of this paper is to bring together two so-far distinct lines along which 
the classical Calderon-Zygmund theory has been generalized: one of them related 
to the domain, the other to the range of the functions under consideration. On the 
one hand, there has been considerable interest in singular integrals with respect to 
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quite general measures (in particular, ones failing the doubling hypothesis), and 
a fairly complete theory is now available especially due to the efforts of Nazarov, 
Treil and Volberg [H HSj , and Tolsa [511 HIl [Mj ■ 

In another direction, where pioneering contributions were made by Bourgain [U 
[2] and Burkholder [3] , much of the classical theory of singular integrals has been ex- 
tended to the setting of functions which take their values in an infinite-dimensional 
Banach space. By the end of the 1980's, this theory had already advanced up 
to the vector- valued Tl theorem proved by Figiel [5]. A more recent twist to 
this second line, boosted by the work of Weis [55], is the further generalization to 
operator- valued integral kernels, although still in the homogeneous (and in most 
cases, Euclidean-Lebesguean) situation as far as the underlying measure space is 
concerned. 

It seems natural to ask for a unification: a vector- valued, non-homogeneous 
Calderon-Zygmund theory which would be a common generalization of the two 
lines of development described above. In fact, the methods of proof in the two 
fields are already quite suggestive of such a convergence, the interplay of probabil- 
ity and analysis being in the centre: Ever since the pioneering contributions, the 
vector-valued theory has heavily relied on probabilistic tools, especially martingale 
differences and their unconditionality (UMD), which is the defining property of the 
class of admissible spaces for most results. Also in the non-homogeneous Tb theo- 
rem |25| , martingale differences were employed to construct the basic decomposition 
of the operator, and Nazarov, Treil and Volberg have added further probabilistic 
ingredients which arc decisive for their analysis. 

I now recall the hypotheses of the Tb theorem of Nazarov et al. concerning the 
underlying measure space and the associated Calderon-Zygmund operators; this 
will also be basic set-up of the present paper. Let /i be a Borel measure on 
which satisfies, for a real number d G (0, N], the upper bound 

fi{B{x,r)) < r"^ 

for any ball B{x,r) of centre x G and radius r > 0. A d- dimensional Calderon- 
Zygmund kernel is a function K{x, y) of variables x,y € M.^ , x ^ y, which satisfies 

\K{x,y)\<-^—^, (1.1) 
\x - yl"^ 

\K{x, y) - K{x', y)\ + \K{y, x) - K{y, x')\ < (1-2) 

for some a > and all variables such that \x — y\ > 2\x — x'\. Of course one could 
allow multiplicative constants in these assumptions (and some which follow), but 
since there will be quite many parameters involved in any case and the full generality 
is reached by trivial scaling arguments, I will restrict myself to the normalized 
situation above. 

Let T : / I— > Tf be a linear operator acting on some functions / (this will be 
specified in more detail shortly). It is called a Calderon-Zygmund operator with 
kernel K if 

Tf{x)^ f K{x,y)f{y)dii(y) (1.3) 
for X outside the support of /. 
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An operator T is said to satisfy the rectangular weak boundedness property if for 
all rectangles R there holds 



as usual in the related literature, a rectangle here means a set of the form R — 
xo + rii^i [~^i/2, C M^. The special case with it = i for all i is called a cube, 
and in this case i{R) := i designates its side-length. For a cube Q and A > 0, XQ 
is the unique cube with the same centre and A times the radius of Q. 
A function b G L^^^di) is called weakly accretive if 



for all cubes Q and some fixed (5 > 0. I fix two weakly accretive functions bi and 
&2, which satisfy the above estimate and in addition ||foi||oo < 1- Below, the weak 
boundedness property will be assumed for the composition of operators Mb^TMi,^, 
where Mb : f ^ b ■ f designates the operator of pointwise multiplication by b. 
A funtion h G Ll^^{fi) is said to be in BMO^(/i), where X,p e [1, oo), if 



where the supremum is over all cubes Q C M^. Here (/i)q := fJ.{Q) ^ Jghd^ is 
the average of on Q. Let some A > 1 be fixed from now on. 

Let then X be a Banach space and X) designate the Bochner space of 

/i-measurable AC-valued functions with its usual norm. The question of interest in 
this paper is the boundedness of T on LP{fi; X). For the sake of simplicity, I will 
concentrate on the quantitative aspect of this problem: I will assume that T is 
in fact defined as a continuous linear operator on the whole space X) from 

the beginning, but I then derive a bound C for its operator norm according to the 
following convention: 

Notation 1.5. The letter C will always indicate a finite quantity, which depends at 
most on the following set of parameters: 



plus a few auxiliary ones which will be explicitly introduced below and eventually 
chosen in such a way that they, too, only depend on the above-mentioned list. 
The numerical value of C need not be the same from one occurrence to another. 
An estimate of the type F < CG will sometimes be abbreviated to < G, and 
F<G<F to F^G. 

Various ways of reducing to the a priori bounded situation have been discussed 
by Nazarov et al. [25]; here I point out just one more strategy, which is specific to 
the present vector- valued context: One starts by considering T on functions taking 
values in a finite-dimensional subspace Xq C X. By choosing a basis of Xq and 
considering the action of T componentwise, it easily follows from the boundedness 
of T on LP{^) (which is the conclusion of the scalar- valued Tb theorem) that it is 
also bounded on LP{ii; Xq), but the bounds resulting from such a simple argument 
will grow as a function of dimXp. However, once it is shown that the norm of T 
on I/^(/x; Xo) is actually bounded by a constant C independent of Xq C X, it also 






(1.4) 



d, N,p, X, a, S, A 
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follows that T extends continuously to all of LP{ii]X) by the density of functions 
with a finite-dimensional range. 

It is well known that the typical singular integral operators T will not extend 
boundedly to LP{ii;X) for an arbitrary Banach space X. In fact, the classical 
Hilbert transform H satisfies H G ^{LP{R;X)) if (Burkholder [3]) and only if 
(Bourgain [1 ) X has the UMD property, i.e., there holds 



<c\\ydk\\ (1.6) 

whenever {dk)^^i is a martingale difference sequence in LP{ii;X), and e/c = ±1. 
This property is known to be independent of the parameter p G (1, oo), and also its 
validity for dyadic martingales with respect to the Lebesgue measure already implies 
the general condition (Maurey [H]). UMD implies reflexivity but not conversely, 
although all the "usual" reflexive spaces (such as the reflexive Lebesgue, Sobolev, 
and Besov spaces, and also the noncommutative spaces) do have UMD. 
It is now possible to formulate the main result: 

Tb theorem 1. Let X be a UMD space and 1 < p < oo. Let T be a Calderon- 
Zygmund operator for which Mb^TMb-^ satisfies the rectangular weak boundedness 
property and 

r&l||BMOi(M) < 1> l|r*&2||BMOi(^) <1- (1-^) 

Then ||r||j^(ip(^.x)) < C. 

The case X = C is a version of the celebrated Tb theorem of Nazarov, Treil 
and Volberg [25] . Its known proof consists of two methodically distinct and essen- 
tially decoupled main parts, as in the classical Calderon-Zygmund theory. First, 
the L^ estimate ||2^||^(l2(^)') < C is proved by exploiting, of course, the Hilbert 
space structure of L'^{fi). Second — although historically this step preceded the first 
one, and was proven in the non-homogeneous context by Nazarov, Treil and Vol- 
berg in |23j — , some weak-type L^ estimates are deduced, and here one employs 
the kernel conditions (jl.ip and (|1.2p plus the already established (or historically, 
postulated) bound. The inequality ||Tl|^(ip(^)) < C for p G (1,2) U (2, oo) 
then follows from the abstract principles of interpolation and duality, so it is in this 
sense reached somewhat indirectly. The present contribution, as a byproduct of the 
vector-valued extension, also offers a new approach to the scalar-valued result in 
i''(/i), which is more direct than the one just outlined for p 2. 

Of course, the rectangular weak boundedness property and the BMO conditions 
(jl.7p are also necessary for Tb theorem [l] since they are necessary in the scalar- 
valued case, and one can identify LP{p) as a subspace of LP{ii;X) by considering 
functions with values in any one-dimensional subspace of X . One could also allow 
only the more restricted cubic weak boundedness property with parameter A > 1: 



/ Ig-Tlgd^ <fiiAQ) 



for all cubes Q C M^. The vector- valued proof could be extended to this situation, 
but the somewhat tedious refinements needed in the argument would be more or 
less a repetition of the corresponding steps from |25j . Instead, this extension can 
be easily deduced from the work already done in the scalar case: 
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Tb theorem 2. Assume the conditions ofTb theorem]^ except that the rectangular 
weak boundedness property of Mb^ TMb^ is replaced by the cubic weak boundedness 
property with parameter A > 1. Then ||T'||_jo(^p(^(.x)) < C , where C is also allowed 
to depend on A. 

Proof. By Nazarov, Treil and Volberg's Tb theorem, jf(i2(^)) < C, hence 
Mb^TMb-^ satisfies the rectangular weak boundedness property. Thus Theorem [T] 
apphes. □ 

The necessity of the assumptions may also be exploited to derive the following 
immediate but interesting variant: 

Tb theorem 3. Let T be a C alder on- Zygmund operator with ||TI|^(l2(^)-) < 1. 
Let X be a UMD space and \ < p < oo. Then ||r||_sf(ip(^.x)) < C. 

Proof. By the converse part of the Tb (or just Tl) theorem of Nazarov, Treil and 
Volberg, T satisfies the rectangular weak boundedness property and T1,T*1 G 
BMO\{fi). Hence Tb theorem [1] (with 6i = 62 = 1) applies. □ 

This allows, e.g., to use the conditions of the accretive system Tb theorem of 
Nazarov, Treil and Volberg [M] (which, by their result, imply the L^(/Lt)-boundedness) 
for checking the LP{^; X)-boundedness of a Calderon-Zygmund operator. 

In the spirit of the recent vector- valued results [HI US], Tb theorem[T]also admits 
a generalization in the context of operator-valued kernels. Integral transformations 
with such kernels arise for instance when solving abstract differential equations 
in a Banach space, where much of the motivation for this kind of considerations 
originally came from; see Weis [29]. From Weis' work and the subsequent devel- 
opments, it has been known for some time that for boundedness results analogous 
to the scalar-kernel case to be valid, one needs to impose conditions which are 
stronger than the first guess "replace all absolute values by norms." Recall that an 
operator family C Jif^X) is called Rademacher-bounded, or i?-bounded, if there 
is a constant c such that for all n € Z+, all ^1, . . . , ^„ € X and Ti, . . . , T„ € 



£kTk£,k 



<c||Vefc^fc , (1.8) 



k=l ^ ' ' k=l 

where Sk are the Rademacher functions, as above. Denote the smallest admissible c 
by ^^{^) and recall the fundamental contraction principle ([B], 12.2), which in this 
language says that ^(A • idx) < 2 sup;)^^^ |A| for A C C; this is the most important 
tool in handling random series as above, which will be present throughout the proofs 
of the various Tb theorems here. 

The rule of thumb, which has guided the recent progress with operator-valued 
kernels, is to replace the boundedness assumptions for scalar kernels by the cor- 
responding Rademacher-boundedness statements in the operator-valued case. The 
following operator-valued Tb theorem implements this idea in the present situation. 
I give a concise statement here, and refer the reader to Section [13] for a detailed 
explanation of the assumptions. 

Tb theorem 4. Let X be a UMD space and 1 < p < 00. Let T be an J^{X)- 
valued Rademacher-Calderon-Zygmund operator for which Mb^TMb-i^ satisfies the 
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rectangular weak Rademacher boundedness property. Let Y C Jif^X) and Z C 
j!f{X*) be subspaces with the UMD property, and 

\\Tbi\\BMOl(p;Y) < 1, \\T*b2\\sMO-^ i^-.Z) ^ 

Then ||T'||^(^p(^(.x)) < C, where C is allowed to depend on Y and Z, in addition 
to the usual parameters. 

For all practical purposes, Tb theorem |4] is a generalization of my operator- 
valued Tb theorem for the Lebesgue measure , although there are minor technical 
points (slightly different notions of para-accretivity and weak boundedness, and the 
treatment in [11' of kernels K satisfying just a logarithmic version of the Holder 
continuity in ()1.2p ) which prevent the above result from strictly covering the earlier 
one. On the other hand, even for the Lebesgue measure, Tb theorem |4] improves on 
that of [TT] in one important respect: the subspaces Y and Z are now only required 
to have UMD; the additional condition imposed in [11], that their unit balls be 
Rademacher-bounded subsets of -Sf(X) and ^{X*), is seen to be superfluous by 
the new techniques. 

Remark 1.9. Since the circulation of the first preprint of this paper, the manuscript 
has undergone quite a substantial evolution. As some citations to this paper have 
already been made, based on what was written in the earlier versions, it seems 
appropriate to comment a little on these developments. 

In my original formulation of all the Tb theorems above, I also needed to impose 
another restriction on the Banach space X, in addition to the necessary UMD 
condition. This was the so-called RMF property, recently introduced by Mcintosh, 
Portal and myself [14], which means the boundedness : LP{fi]X) LP{^) of 
the Rademacher maximal function 

where the i?-bound is over all dyadic cubes Q containing x e M^, and the Banach 
space X is identified with the operator space (C; X) (or ^(R; X)) in a canonical 
way for the computation of the i?-bound. This condition was originally studied 
jl4j only in the case when d/x = Ax, but Kemppainen [17 has shown, in analogy 
to Maurey's classical result for UMD [TH], that the RMF property with respect 
to the Lebesgue measure already implies it for other measures as well, and it is 
also independent of the parameter p £ (1, oo) appearing in its definition [14[ I17j . 
This notion and the related results played an important role in finding the original 
weaker versions of the Tb theorems above. 

The RMF property was related to the estimation of the paraproduct parts of 
the operator T and could originally be avoided (trivially) if Tbi = T*b2 = and 
also (non-trivially but straightforwardly) in the case when the measure /i satisfies 
the doubling condition ii{B{x,2r)) < Kp,{B{x,r)). Later on [13], I also found 
a somewhat complicated argument to eliminate the RMF assumption under the 
condition that Tbi,T*b2 6 L°°(/i), and conjectured that it should be eliminated 
altogether. The related ideas, although not exactly along the lines suggested in |13j . 
eventually led to the resolution of this conjecture in the form of the Tb theorems 
as stated above. 

I conclude the introduction by commenting briefly on the L^'-boundedness of 
Cauchy integrals, a fundamental question to measure the advances in the theory 
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of singular integrals both in the scalar- valued and the vector- valued developments. 
The boundedness of the Cauchy integral on the circle, i.e., the Hilbert transform, 
is of course a classical theorem of M. Riesz, and the extension of this result to 
the UMD- valued setting by Burkholder [3], together with the converse statement 
by Bourgain [1], may be considered the beginning of harmonic analysis in UMD 
spaces. 

It was around the same time that the scalar-valued L^-boundedness problem 
of the Cauchy integral on arbitrary Lipschitz graphs was answered positively by 
Coifman, Mcintosh, and Meyer [3]. A few years later, this could be seen as a special 
case of the (homogeneous) Tb theorem due to David, Journe, and Semmes [5]. 
The corresponding result in UMD spaces became available after Figiel proved his 
vector- valued Tl theorem [8] , since this bootstraps into Tb by the same trick as in 
Tb theorem [3] above. 

Finally, a precise geometric characterization of the measures for which the as- 
sociated Cauchy integral is bounded was identified in terms of a local curvature 
condition on /i in an accumulation of efforts by several authors [HI [211 [221 [26] . As 
before, this then became a corollary of the more general Tb theorems. The present 
results, once again, bring the vector-valued theory of the Cauchy integral to the 
same level. 



Among the large family of existing Tb theorems, there is no question about the 
parents of the present one: they are the non-homogeneous Tb theorem of Navarov, 
Treil and Volberg [25] , and my operator- valued Tb theorem [11] . A reader familiar 
with the proof of either one of them will recognize much of the same general struc- 
ture here, but in the details there are also substantial departures from the earlier 
approaches. This section gives an outline of the proof with commentary on the 
relation of its various parts to the existing arguments. 

The proof starts from a "twisted" (or "adapted") martingale difference decom- 
position of the operator T. Let ^ — be a system of dyadic cubes in R^: 
each subcollection S^k is of the form 



for some Xk 6 K^, and each Q e is the exact union of 2^ cubes Q' £ S^k-i- 
Let Efc := E[-|tT(f^fc)] be the associated conditional expectations which, because the 
CT-algebra cr(i^fc) is atomic, admit the explicit representation 



If /i(Q) ~ for some cube, the term corresponding to Q in the above series may be 
simply taken to be zero. Note that, following [25j, the "geometric" indexing of the 
dyadic partitions &k is used, where larger k refers to larger cubes; this is different 
from the "probabilistic" indexing, where larger k refers to a finer cr- algebra and 
hence smaller generating cubes. 

Given a para-accretive function fe, the 6-twisted conditional expectations and 
their localized versions, for fc e Z and Q E are defined by 



2. Strategy of the proof with historical remarks 



i^fc = {xfc + 2'=(n+[0,l)^) : 






E^/ := IgEif, 
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and the corresponding twisted martingale differences by 

(In [TT], the adjoints of these operators are used instead, which does not make any 
essential difference.) By martingale convergence, there holds E^/ f pointwise 
a.e. and in LP(fi) as k ^ — oo. 

For any m £ Z, it then follows that 

f=Y,Dif+Eij= Yl E Kf (2-1) 

k<m Qe® Qe@ 

£(Q)<2'" £(Q)=2" 

with unconditional convergence in LP(fi;X) under the UMD assumption (see Sec- 
tion|3]for details). So far everything is practically the same as in both [TT] and [5S], 
with only minor technical differences. If / is compactly supported, then for all m 
large enough, depending only on the diameter of the support, the second sum on 
the right contains at most 2^ non-zero terms. 

As the first departure from [25], but still quite closely following [TT], the projec- 
tions IDq and Eg will be further represented in terms of rank-one operators as 

2~-l 

By = J2 b^Qj^Q,uJ), ^qJ = b^Q^Qfi, /), (2.2) 

u=l 

where quite precise information (established in Section [4|) about the "Haar" func- 
tions (fq „ will be essential in deriving the required bounds. There is a qual- 
itative difference between the cancellative functions (pg ^ with J bipg „ d/i — 0, 

2/2 

and the non-cancellative (Pqq ■= (/gfod/i) 1q. Recall that the classical L^- 
normalized Haar functions hq associated to a dyadic cube Q satisfy ||/iq||i = 
II^qIIoo = IQI^^^^j and the equalities remain true up to constants even in the b- 
twisted case [TT]. In the present situation, there is no upper control of the L°°{fi) 
norm of (pq ^ in terms of the measure fJ.{Q), but this can be compensated by the 
smallness of the L^{^J^) norm, so that the following important property still holds: 

II'^q,«IUi(a')II'/'q.uIIl~(m) ^ 1- 

To estimate the operator norm ||r||^(ip(^.x)) , a pairing (g,T/) will be con- 
sidered, where the compactly supported / G LP{^;X) and g £ LP {fi;X*) are 
expanded by means of ()2.1|1 and (12. 2|) . now taking one of the two para-accretive 
functions 6i and 62 from the assumptions of the Tb theorem in place b: 

{g,Tf) = Ihn E(5,^fi%)(^«%&2,T(6i^^^,J)(^^^,„,/), (2.3) 

t<l(Q),l(R)<2"^ 

where the summation condition for u is u = 1, . . . , 2^ — 1 for ^(Q) < 2™ and 
u = 0, 1, ... ,2^ — 1 for i{Q) — 2™, and similarly for v in terms of R. For a 
fixed e > 0, the multiple summation consists of only finitely many non-zero terms, 
legitimating all the rearrangements that one may like to make in the course of the 
proof. 

Following Nazarov, Treil and Volberg [25], the functions / and g are expanded 
in terms of "Haar" functions related to two different dyadic systems S! and Si' . An 
important aspect of the proof, already in [25] and even more decisively here, is the 
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fact that parts of series (j2.3p cannot be directly controlled for a pair of preassigned 
dyadic systems, but only on average after taking the expectation over independent 
random choices of and f^' — the underlying probability distribution is explained 
in Section [S] In this respect, it is useful to observe that the a priori continuity 
of T ensures the uniform boundedness (involving the operator norm of T) of the 
truncated series in (|2.3p . so that the expectations may be moved in and out of the 
limit by dominated convergence. 

This expansion of {g,Tf) — based on two independent multiresolution analyses 
of the domains of / and g (both of which are equal to M.^) — is essentially different 
from the one which Figiel introduced for the Tl theorem in [8] and I adapted for 
Tb in llj. In Figiel's approach, a single multiresolution analysis of the product 
domain x M.^ oi f ® g was employed, which would mean that the summation 
over Q G S' and R G S'' comes with the restriction to cubes of the same size, 
£{Q) = e{R), while the summation range of (m, u) is {0, 1, . . . , 2^ - l}^ \ {(0, 0)} on 
all the length-scales. 

To simplify notation, the summations over the bounded ranges of u and v will 
mostly be suppressed, and I write 



fQ — ^Q,u' V'i? := ^%v^ Trq := {i^Rb2, T{bnpQ)} 



for short. 

As in [25] , the analysis of the series in (|2.3p will be divided into several cases 
depending on the relative size and position of the cubes Q, -R G ^. By symmetry, 
it suffices to consider the half of the series with £{Q) < £{R). Modulo the extrac- 
tion of appropriate paraproduct operators (defined and treated in Section [9]) , the 
coefficients Trq exhibit good off-diagonal decay when the cubes Q and R move 
apart in the "phase space" , where the coordinates are the spatial position and the 
size of a cube. Thanks to this decay, it is possible (in Section [7]) to separately 
treat countably many subseries of (12. 3p , a typical one consisting of cubes such that 
dist(Q,i?) - 2H{R) and £{Q) = 2-"-£{R), and the decay will provide estimates 
which allows to make the final summation over j, n = 0, 1, 2, . . . with absolute con- 
vergence. A further separate treatment is made for cubes of which one contains the 
other (deeply) in its interior (Section [8]), and yet another for cubes of essentially 
the same size and very close or even touching each other (Section llOp . 

In each case, the subseries in question (consisting oi R G and Q from some 
subcoUection &{R) C i^, depending on R) is first estimated by the following basic 
randomization trick: 



Lemma 2.4. The following inequality holds: 



(2.5) 

fcez R&S'k Qe&(R) 



LP(P®A';X) 
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Proof. The construction of the Haar functions gives the identity / b2ipji d/j, 
Using this in the first step, one estimates 



1. 



TRQiipQj) dP(£)dM(a;) 



E 

AG®' 



E 

Q£S>(R) 



< 



E 



£s{9,tps)b2^s\ 



LP'(P(8/j;X*) 



X ^ SRi'Ri^ 



fie®' 



Qe&iR) 



Lp{r0fj.-x) 



< RHSSTEl 



where the final estimate for the first factor is an appUcation of the unconditionality 
of (|2.ip in LP (p; X*); in the second factor, the basic observation is made that, for 
a fixed x G M^, the summation over R G ~ Ufeez ^'k only contains one non-zero 
ipnix) for each and hence it does not matter if the random signs are indexed by 
cubes or the size of the cubes. (This observation will be repeatedly applied without 
further notice.) □ 

The collections !^{R) are always of such a form that R £ implies ^(i?) C 
^k-n for some n G N, independent of R. Also, when Q e i^(-R), the cube R will 
be contained in a dyadic ancestor Q^"'*""-' of Q, where a = a(j) grows linearly in j 
(recall that we are considering cubes with dist(Q, R) ^ 2^£{R)), but for a technical 
reason with a slope slightly bigger than 1. The quantity in the U'{ijl;X) norm to 
be estimated is hence of the form 



1' 



(x) / Ksix,y)Btj{y)dt,{y) ^ eferW©t„/(^), 



and it remains to prove that 
E||5]efeT«I])t„/ 



kei 



<2-("+^>E||^£fcB^L„/ 



(2.6) 



since this is bounded by 2~("+-')'^||/||ip(^.x) due to the unconditionality, and the 
exponential factor allows the summation over n, j G N to complete the estimate of 
the full series (1231). 



The integral kernels Ks will typically satisfy bounds of the type HA'sHoo 
2-("+j)'^£(5')-'i^ with similar but somewhat more complicated form when j G {0, 1}, 
i.e., when the cubes Q are close to or contained inside R. Getting these estimates 
requires the fine properties of the "Haar" functions ipq and ipR. Recalling that 
f^{S) < e{S)'^, it is seen that 2("+-')T('^)i^(a;) is a weighted average of F in a 
neighbourhood of x. 

In the classical Calderon-Zygmund theory, such averaging operators were usually 
controlled by the Hardy-Littlewood maximal operator M, and the estimate p.6|) 
could be deduced from the Fefferman-Stein square-function estimate for M. The 
lack of a comparable vector-valued theory of a maximal function has necessitated 
the invention of alternative tools to circumvent the maximal function arguments in 
the estimation of integral operators. 

A powerful substitute was provided by Bourgain's square function estimate [2] 
for the translations Ty : h i-^ h{- + y) , which can be viewed as the basic building 
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blocks of integral operators via the formula 

Ks{x,y)f{y) dniy) 



^ (2.7) 

= / Ks{x,x + i{S)u){T,^s)uf){x)dtiix + e{S)u). 
J B(a,c) 

(Note that this simlifies for the Lebesgue measure dfi{y) = dy, since then dfi{x + 
l{S)u) = £{S)^ du, so that the integrations on -6(0, C) can be carried out with 
respect to a fixed reference measure.) Bourgain showed that 



^j'''2iyfj 



LP(f2xR";X) 



<log(2+ |j/|)|| Ve,-/, (2.8) 



jei, j& 

(where is equipped with the Lebesgue measure) assuming that the Fourier trans- 
forms of the fj are restricted by the condition supp/j C _B(0, 2^^ ) — a condition 
which is naturally satisfied when these functions arise from a Littlewood-Paley-type 
decomposition. Figiel [7] gave a variant of this result where it is required instead 
that fj = V.jfj and y £ Z^, which would be closer to the present martingale setting. 
(The original formulation in in terms of the Haar functions is slightly different 
but the equivalence is immediate.) 

All the known Banach space -valued Tl and Tb theorems so far have been based 
on one of these two remarkable results: Figiel's Tl [8] and my Tb [11] on the martin- 
gale version, and the Tl theorem of mine and Weis [l6j on the Fourier-analytic one. 
However, a moment's thought reveals that there is no hope of extending the trans- 
lation techniques to the non-homogeneous situation. Since only an upper control 
of the measure of balls is assumed, a small translation of just a single function (not 
to mention a sequence of functions as above) may result in its support being moved 
from a set of negligible measure to one with a large /i-mass, with uncontrollable 
effect on the LP norm. 

To overcome this problem, I use a different trick based on a two-sided inequal- 
ity for so-called tangent martingale difference sequences due to McConnell [20] . 
This is a stochastic decoupling estimate, explained in detail in Section [SI which 
McConnell originally employed for the construction of Ito-type integrals of UMD- 
valued random processes. Thus the trick itself is not new, but it seems not to 
have been exploited in the context of Calderon-Zygmund theory before. Although 
it still avoids maximal functions, this method is somewhat closer in spirit to the 
classical maximal function techniques than the translation inequalities (j2.8p , which 
have been the most refined tools in vector-valued harmonic analysis for the past 
twenty years. I expect this trick to find further applications besides the results of 
the present paper. Indeed, after originally writing this prophesy, I already discov- 
ered one such application (new even for scalar-valued functions) in the context of 
pseudo-localization of singular integral operators [H], and there should be more. 

This concludes the historical-strategic overview, and I now turn to the details. 

3. A Carleson embedding theorem 

This section provides a "Carleson-type" embedding theorems, which will play a 
role both in establishing the unconditionality of the twisted martingale difference 
decomposition (|2.ip in the next section, and later on in handling the paraproduct 
parts of the operator T. The result will be formulated in an abstract filtered space 



12 



T. P. HYTONEN 



setting, since the special case of actual interest involving with its systems of 
dyadic cubes would not provide any simplification and could at most distract the 
attention from the measure-theoretic core of the arguments. 

Let (E,^,^) be a a-finite measure space. Let ^ — be a decreasing 

sequence of sub-cr-algebras of i.e., ^j^i ^ ^j, such that each {E, is also 

cr-finite. The short hand notation E^- := E[-|^j] will be used for the corresponding 
conditional expectations. Let ^J*" consist of the sets A S of finite positive 
measure. 

Given a sequence of functions 9j : E ^ Xi, such that Ia^j G L^{E;Xi) for 
all A C of finite measure, the following Carleson norms were introduced by 
Mcintosh, Portal, and the author [Mj in a special case: 

, i/p 



ICarP(^-Xi) 



sup 

fcez 



1^' 



IP 

\LP{n;Xi) 



sup sup f^{A) "^/P l^^ej^j 
feez ag,^+ 



L--iE) 



LP{nxE;Xi) 



Proposition 3.1. //( 



EjOj for all j G Z, then the Carleson norms 

ll{^jbez|lcarP(#;Xi) 

are equivalent for all p £ [1, oo). 

Proof. This could be proven in a similar way as the well-known equivalence of the 
different (martingale) BMO^ norms. Instead, I will show how to reduce the claim 
to the mentioned result. By approximation, it suffices to treat finitely non-zero 
sequences 9j in order to avoid problems of convergence in the following expressions. 
Consider the ffitration = {'^k)kGZ on x i?, defined by :— a{S'k,^k), 



where (ok := a{ej;j > k). Let E^ 



E[ 



The space BMOP(^fe;Xi) consists of 



dl\ 6 : E x Vl ^ Xi, integrable over sets of finite measure, such that 



BMOP 



E 



fe+ll 



. sup (Efc|0 

feez 

sup sup fi{Ay^/'P\ 
feez A£'^+ 



i/p| 



ij,[0-Ek+ie]\\p, 



/i := /i X 



is finite. 

Now consider the particular function 



= E,ez£j^i- Then 



j>k 



■I] 

Efe 



j<fe 



and, by the tower rule for conditional expectations with respect to C a(S'k, ^), 



ife+i^?!^^ = EfeEfl V e,e, " ^) 

j<k 



The conditional expectation inside is computed by keeping the variables Sk and 
X Q E fixed and taking the average over all ej for j < k. Writing e'j for another 
set of independent random signs and E' for the corresponding expectation, this 
quantity can be written as 

p 



E'\J2^'i^3+^k0k 
j<k 



E' 



IE 

j<k 



Xi 
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where the equahty follows from the observation that the first expectation is actually 
independent of the sign e^. Hence 



Xi 

j<k J ^ 



LP(n;Xi} 



Applying sup^.^^ II(')^^^IIl°°(-E) to the left side above, one gets the martingale BMO^ 
norm of 9, while the same functional of the right side yields l|{^'j}jGz||(;;arJ'(#Xi)' 
The equivalence of the Car^ norms thus follows from the equivalence of the (mar- 
tingale) BMO^ norms, which is the well-known John-Nircnberg inequality. □ 

Remark 3.2. With a one-point measure space E — {e} and 9j — £,j € Xi, it follows 
that ||{6'j}jez||carp = || J2'^j^j\\Lp(n;Xi)- Hence the previous proof shows that Ka- 
hane's inequality (the equivalence of the different norms of such random sums; 
[6j, Theorem 11.1) is a consequence of the martingale John-Nirenberg inequality. 
This is probably known to experts, but I did not encounter this observation before. 

Suppose that there are three Banach spaces Xi, X2, with X2 C J^'{Xi, X3); 
the point is here that X2 may be required to have some properties which the full 
operator space ^{Xi, X^) would almost never satisfy. Given a sequence {9j}jez S 
Car^(^;X2), the "paraproduct type" operator 

P/:=5]e,0,E,/, (3.3) 

acting on / G LP{E; Xi), is of interest. 

There are two closely related results which guarantee the boundedness of P from 
LP{E;Xi) to LP{n X E;X3): 

Theorem 3.4. Let X3 be a UMD space, and 1 < p < 00. Let {9j}j^z be a sequence 
such that 9j — Ej9j for all j G Z. Then P defined in (j3.3p satisfies 

\\Pf\\Lp(ny.E:Xi) ^ \\{0 j} jei^Wc^^i (^0.x2)\^f\\Lv(E■,Xl)■ 
'T\leorerx^ 3.5. Let Xi be an RMF space, 1 < p < 00, and rj > 0. Let the unit-ball 
of X2 be a Rademacher-bounded subset of J^{Xi,X^). Then P defined in p.3|) 
satisfies 

\\Pf\\LP{nKE;X3) ^ l|{|^j(-)U2}jGz|lcarr'+''(.#)ll/lli''(-B;^l)- 

Although it played an important role in earlier versions of this paper, Theo- 
rem [3?5] is eventually not needed here, and it is only recorded above for reasons of 
comparison. As stated, it is a slight generalization of Theorem 8.2 from my paper 
with Mcintosh and Portal [13] and may be proven by an adaptation of the same 
argument. In an earlier version of this paper, I had tried to push the analogy of 
Theorems 13.41 and 13.51 a bit too far by attempting to deduce even the former one 
by a variation of the same technique. This argument turned out to be flawed, and 
I am now unaware of any method of proof which would give both theorems as 
applications of a common general principle. 

The proof of Theorem l3.4l which I will give, follows a similar approach as the cor- 
responding results behind the earlier vector-valued Th theorems, which goes back 
to Bourgain (see Figiel and Wojtaszczyk [9], who attribute a key step of their argu- 
ment to him). It relies on interpolation between appropriate H^-L^ and L°°-BMO 
estimates, where the martingale versions of these spaces will be relevant. Recall 
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that the martingale Hardy space H^{J^; Xi) consists of the functions / G L^{E; X) 
with Mf := supj,g2 |E/£/(-)|xi G L^[E). For the present purposes, however, it will 
be convenient to use the characterization of this space, due to Herz TOj, as 

H^{,^;Xi) ^ ''L\^;Xi) + AC{^;Xi), K r < oo. 

Here the space Xi) of -regulated -functions consists of all / G L^iE; Xi) 

with a representation / = J^JLi where {Xj)°^i G £^ and each aj G L^{E; Xi) 
is an atom of L'"-type. This means that there exists k = k{j) G Z and A G 
such that aj = lACij, EfcO = 0, and ii{Ay/^ ||a||r < 1- The norm in this space is 
the infimum of the norms of the coefficient sequences over all such representa- 
tions. The space AC{^; Xi) of absolutely convergent i^-martingales consists of 
h ^ "Ekez^kh G L^E;Xi) with \\h\\Ac ■= Efcez \\^kh\\i < oo. This latter com- 
ponent of is required, in addition to the atomic part familiar from the 
classical theory, because of the non-doubling nature of the underlying measure. 

For convenience, it will be assumed that the sequence {^j}jez is finitely nonzero, 
but the bounds will be proven in terms of its Carleson norm only. Then it is 
straightforward to pass to the general case in the final estimates of interest. 
The proof begins with: 

Lemma 3.6. Let X^ be a UMD space and r G (l,oo]. Then 

|l^'/IUl(£;xf2;X3) < ll{^j}j6zllcarl(J?;X2)ll'^ll'-Ll(J?;Jfi)- 

Proof. Thanks to the a priori boundedness under the assumption that {Oj}je'L is 
finitely non-zero, it suffices to prove the uniform bound on all atoms a. So let 
a = Xao- with A G EfcO = and ^{A^/^ \\<A\r ^ 1- Let also the Carleson norm 
be normalized to be 1. Choose auxiliary exponents p G (l,?") and q G [p, oo) such 
that 1/p^l/q+l/r. Now Eja = E^Efca = for j > k, while djEjO = l^Ej (6'ja) 
for j < k, and hence 

]<k 

This leads to the estimate 



llPalli <M(A)i/^''||^e,E,(0,a) < 11 ^ £ 



ia 



3<k j<k 



< f,{Ay/p' II J2 ^j(^Aa \\a\\r < niAY^^'niAy/'^KA)-'^'-' = 1. 

M q 

J<k 

where the second step was Bourgain's vector- valued Stein inequality [2], and all the 
other bounds are elementary. □ 

The H'^{^]Xi) ^ L^{E X f^i^s) boundedness of P is completed by the char- 
acterization of Herz together with: 

Lemma 3.7. For arbitrary Banach spaces, 

l|-P/llLi(Sxf2;X3) ^ ll{^j}j6z|lcari(#;X2)ll/llAC(#;Xi)- 

Proof. Notice that EjDfe = for j < k and zero otherwise. Hence 

j<k 
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and thus (taking the Carleson norm equal to one again) 



< 



Exn 



where the second step fohows from the fact that D^./ is aheady $^fe_i-measurable, 
while in the third one it was observed that the quantity in parentheses is uniformly 
bounded by the ||{%}jez||cari- Summing over fc e Z completes the proof. □ 



In the upper end, the space BMO(^^; X^) on the product measure space E x ft, 
as defined in the proof of Proposition 13. 1|. is needed. 

Lemma 3.8. Let be a UMD space. Then 



II^/IIbMO(#;X3) ~ ll{^j}jez|lcari(#;X2)ll-^lli~(^;^i)- 

Proof. As in the proof of Proposition [SUl there holds 



Let A e ^^fc_i, and notice that the signs ej, for j < fc — 1, are independent of 
as well as, obviously, of the functions Ej{9jf). Hence they may be replaced by 
independent copies e'j on another probability space fi' as far as the computation of 
norms is concerned. This leads to 



\lAiI -^k)Pf\\Lp(Exn;X,) 

]<k-i 



LP{Exn-LP{n';X3)) 



The observation that the inner LP{Q'; X^) norm is actually independent of the value 
of Ek-i G { — 1,-1-1} allows even its replacement by an independent copy, resulting 



WIaH -^k)Pf\\LHExn;X,: 



LP(Exn;LP(n';X3))' 
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By Fubini's theorem, A{uj) -.^ {x <E E : {x,uj) G A} e -^k-i for P-a.e. e 51, so 
that the multipUcation operator with '^Aiu) commutes with Ej for j < k. Thus 



Li'(Exn;LP(Q';X3)) 



j<k 



j<k 



j<k 



p 



dP(t^) 

LP{Exn-X3) 

||/||„odPH 

LP(Exn;X2) 



j<k 

< [ /i(AM)dPH||/|U=A(^)ll/l|oo, 

Jn 

where the first estimate was Bourgain's vector-valued Stein inequahty, the third 
one the assumed Carleson condition, while the second is obvious. This completes 
the proof. □ 

Taken together, the last three lemmas yield: 

Proof of Theorem \3.4\ It has been shown that 

P : H^{,0-Xi) L^{E X f7;X3), P : L°°{E-Xi) -> BM0(#;X3); 

thus the composition of P with the sharp maximal operator M"^ (more precisely, 
its martingale version with respect to ^) maps 

M*P : H\^;Xi) L^'°^{E x il), M*P : L°°{E;Xi) L°°{E x n), 

and hence the boundedness of 

M*P : LP{E; Xi) LP{E xil), P : LP{E; Xi) L'^E x X3) 

follow from standard interpolation results. □ 



As a matter of fact, the preceding proof of Theorem 13.41 was essentially written 
down in my original manuscript of [llj already, but as this generality was not 
necessary for the version of the Tb theorem then under consideration, the referee 
insisted in leaving it out in favour of a simpler argument valid for doubling measures 
only, and hence it did not appear in the published version of [llj . 

4. Martingale difference decomposition 

In this section I prove the unconditional convergence of the twisted martingale 
difference decomposition stated in ()2.1|) and ()2.2p and establish the basic properties 
of the "Haar" functions 0q , tJjr appearing in this decomposition. By standard 
considerations involving duality and the density in LP(fi) of linear combinations of 
indicators of dyadic cubes, it suffices for (|2.1|1 to show the following randomized 
unconditionality estimate. In the doubling case, it was proven in ^llj . 



Proposition 4.1. 



Y^ekBlf < ||/|Up(p;X). 
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Proof. Write out 
1 



Efc-i/ 



hb 



Ekb-Ek-ib 



Ek-if- 



Ekb 



(4.2) 



and observe that the factors b and Efc& may be discarded by their boundedness from 
above and below. The second term on the right above is then simply a martingale 
difference of /, so their random sum is estimated as a direct application of UMD. 

The random sum of the first terms gives the paraproduct Pf from (|3.3p with 
Ok := Dfe+i^ = EkOk, and hence is dominated by ||/||lp(p;j>s:) times the supremum 
over fc e Z and A e of 



, +\\lA^k+lb\\LH^A, q&ip,oo). 



The first term in braces is bounded by C||1^6||i5(^) < C/i(j4)^/9 ||6||oo using the 
UMD property of C, while the second one is dominated by ^(A)i/9(||Efc6||oo + 
||Efe+i6||oo) < 2/Lt(A)^/^||6||oo, since the conditional expectations are contractions in 
L°°{E). Collecting everything together, the proof is complete. □ 

I then pass to the finer decomposition of the martingale differences B^/ in terms 
of rank-one operators. Generalizing the notation Eg for Q £ S), denote 



J^bdfi 



■b. 



when A is any measurable set with 6 d/i 0; the special case 6 = 1 will be 
abbreviated as := E^. If ja/ is a disjoint collection of such sets, write 

E^/ := J2 <f- 



With this notation one can express 



= E" 



{Q'e&;Q'CQ,e{Q')=i{Q)/2}f Eg/. 



Lemma 4.3. For each Q E its 2^ subcubes Qu £ ^ with £{Qu) — i{Q)/2 may 
be indexed in such a way that 



bdfj, 



> [l-ik^l)2-"]StiiQ) 



(4.4) 



for all k ~ 1, . . . , 2 



N 



Proof. The case fc = 1 is fine for any ordering of the subcubes. Let us assume 
that we have an indexing of the cubes Qi, . . . ,Qj-i so that (|4.4p holds for all 
k — 1, . . . ,j < 2^ . In particular 



[1 - (j - l)2-'']5^,{Q) < 



< 



2" 

Is 

u=j 

27V _ 



b d/i 



1 



}N 



2" 2" 

IE E 

u—j i—j^l^u 



bdp, 



2N 



■J 



max 
«e{i,...,2«} 



bd^ 
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It follows that for at least one u S {j, . . . , 2" }, we have 



> [1 -j2~^ ]S^,{Q) 



5d/i 

By reordering the remaining cubes, we may assume that u — j, and then we have 
fixed an indexing of the cubes Q i , . . . , Q ^ so that (|4.4p holds for all A; = 1 , . . . , j + 1 . 
Thus the claim follows by induction. □ 

Let the indexing of the subcubes Qu henceforth be the one provided by the 
Lemma. Let Qk '■— Um=*: Q^, so in particular Qi = Q and Q2N = Q2N ^ and the 
Lemma implies that fJ-{Qk) ^ (while "<" is obvious), since b is bounded. 

One obtains the splitting 

2"-l 2"-l 



X! ['^{Qi,...,Q„,Q„+i} ^{Qi,...,Q„-i,Q„}] ■ X! 



u—1 u—1 



Now take a closer look at Dg ^; the abbreviation f{A) :— J^f will be used, 
with the same convention for b in place of /. Assume that > 0. 

= b(l liRA + l 1 /(Qn) + /(Qn+l) \ 



where 



i . KQu)b{Qu+i) ( 1q„ 1 

the choice of the sign of the (in general complex) square root above is irrelevant 
and may be made arbitrarily. 

If niQu) = 0, then Dq „ = 0, and one may define (pg „ := 0. The following lemma 
collects several basic properties of the functions (pq „ which are straightforward 
consequences of the previous considerations. 

Proposition 4.5. The "Haar" functions satisfy 



and if (pg „ ^ 0, then 

I.U = v^(^ + ^)^ 

Hence 

II^q,JIl.(p) ^MQn)^/^-^/^ [1,00], 

and in particular 

II'^q,uIUi(a')IIvq,uIIl~(m) ~ 1- 
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5. Random dyadic systems; good and bad cubes 

In this section I give a convenient parameterization of the dyadic systems as con- 
sidered above, and use this to introduce a probabihty distribution on the coUection 
of all such dyadic systems. The construction is equivalent to that used by Nazarov, 
Treil and Volberg (^25j, Sec. 9.1), but it will be given in a somewhat different and 
hopefully transparent way. 

Let denote the standard dyadic system consisting of all 2''{n+ [0, where 
fc e Z and n G Z^. A general dyadic system has been defined as a collection 
^ — Ufcez where &k = Xk + f*-"^ some Xk G and in addition the partition 
refines S!k+i- 

There is obviously some redundancy in the choice of a;^, since only its value 
modulo 2^ (in each coordinate) is relevant. Thus, without loss of generality, it may 
be assumed that Xk S [0, 2'^[^. On the other hand, the condition that S!k refine 
S!k+i can be rephrased as Xk = Xk+i mod 2^ , or in other words Xk+i = Xk+ fik'^^ 
for some (ik G {0, 1}^. It follows by iteration that 

a;fc=^/3,2^ £ {0, 1}^. 

Hence the whole system ^ can be thought of as a shift of the standard system, 
& = + where (i is the formal power series (3 = /'i^^ ' understood 

that a truncation modulo 2*^ of this series is first made before computing the shift 
g + /? Q + E,<fe /3j2^ for Q G S^l 

Now that all dyadic systems have been parameterized by /3 S ({0, 1}^)^, there 
is an obvious way to interpret a "random dyadic system" by assigning the natural 
product probability on {{0, 1}^)^ so that the coordinate functions (3j are indepen- 
dent and P(/3j = v) — 2~^ for all rj € {0, 1}^. (Actually, since it was required that 
Xk = X]j<fe Pj'^^'' ^ [0; 2'''[^, one should exclude the sequences (3 with the following 
property: for some fc G Z and i G {!,..., N}, the ith coordinate of fij equals 1 for 
all j < k. But this does not affect any of the probabilistic statements, since this 
kind of sequences have probability zero.) 

Note that the formal shift parameter (3 = X^jez/^j^-' cannot in general be 
replaced by real shift by some vector x G M^, and in fact the dyadic systems 
X + '3^ have vanishing probability among all dyadic systems. One can show that 

= /3 + '3)^ is of the mentioned special form if and only if there is a fc G Z and 
T] G {0, 1}^ such that /3j = ry for all j > k, and clearly this kind of sequences have 
zero probability among all (/? G {0, 1}^)^. 

I next recall the notion of singular cubes, essentially following [25^, Dcf. 7.2. This 
involves two auxiliary parameters 

a 

^ '~ 2{a + d) 

and (a large) r G Z_|_, which will be chosen later. It is required that, at least, 

2'-(i-7) > (5,1) 

where A is the parameter of the BMO^ spaces in the assumptions. 
A pair of cubes {Q, R] with t{Q) < i{R) is called singular if 

dist(Q,9i?) <i{Qyi{R)^-'^ 
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when S = R or when S is any one the 2^ dyadic subcubes of R with £{S) = \l{R). 
The pair {Q^R] is called essentially singular if, in addition, £{Q) < 2^'^£{R). 

Given two dyadic systems & and ^, a cube Q G ^ is called bad with respect to 
if there exists an i? e ^ with £{R) > £{Q) such that the pair {Q, R} is essentially 
singular, and otherwise it is called good. Just like in [5S], I will consider martingale 
difference decompositions with respect to two independent random dyadic systems 
^ = and ^' = ^f^', where /3,/3' G ({0,1}^)^. There wiU be a separate 
treatment for the good and bad cubes in these systems but, deviating slightly from 
|25j . the relevant badness oi Q € & will be defined not directly with respect to 
but with respect to a modification thereof. 

To this end, let us consider two more random binary sequences /3 and (3' . Then I 
declare that Q G ^ is bad if it is bad with respect to the dyadic system &q parame- 
terized by the binary sequence (/3j)23<£(Q)U(/3j )23>f(Q), and similarly R € is bad 
if it is bad with respect to the system parameterized by {Pj)2i<i{Q)^{f^j)2i>e(Q)- 
The notation i^bad, -^good will be used for the bad and good subcollections of ^, 
similarly for 

The advantage of this definition is that the badness or goodness of any Q e is 
independent of the positions of the relatively smaller cubes R G , £{R) < £{Q), 
which depend on (/3j)23<f(_R) only. This has the following useful consequence: 

Lemma 5.2. Let (j^iQ, R) be a function depending on two cubes Q and R. With (3' 
and P' fixed, under a random choice of (3 and (3, the expectations of the series, 

fles>' Qes> Res'' ^ qg® 

e{Q)<e(R) e{Q)<e{R.) 

differ only by a multiplicative factor depending on the parameters r and 7. 

Thus, under the expectations, we may take the bigger cube in such summations 
to be restricted to the good cubes or not, as we wish. Since /?' and /?', which 
determine the goodness of Q G i^, are kept fixed, the result also remains true if 
Q G & is replaced by Q G ^good in both inner sums; this amounts to replacing 
0((5, R) by lgood('9)'/'(Qj R)i which is just another function of Q and R. 

Proof. Observe that, by reasons of symmetry, the probability of goodness 

TTgood := Ppp{R G i^good) — IE^/3lgood(-R), 

is independent of the particular R G -^good- Using the parameterization in terms of 
so that there is no randomness in the summation variable, one may compute 

TTgoodlE^^ (t>{Q,R) 

ReSi' QeS> 

l(Q)<l{R) 

= YY vSoAmf3^<i>{Q+f3,R), 

l{Q)<l{R) 

where the dependence of the goodness of i? on /3 and [3 has been indicated explicitly 
for clarity. Here Q + (3^ and hence + i?), depends only on (3j with 2-' < £{Q), 
whereas the goodness of R depends on (3 and I3j for 2^ > £{R) > £{Q). Thus, by 
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the product rule of expectations of independent quantities, the computation may 
be continued with 



E E <^(Q'^)' 



-Re®' d QG® 

l(Q)<l(R) 

and this is what was claimed. □ 

The modified notion of good and bad is still reasonably close to the original one, 
in the following sense: 

Lemma 5.3. //Q G ^good and R G S'' with £{R) > 2''£{Q), then 

dist(Q,9i?) > ^£{Qye{Ry~^. 

Proof. Under the assumptions, the cube i? i? + J223<e{Q) "^Hf^j ~ Pj) belongs to 
^'(Q) and hence dist(Q,(9i?) > £{Q)'i£{R)'^-'^ . But then, using ((O) . 

dist(g, dR) > dist(g, dR) - £iQ) 

> l{Qyi{Rf-^{l ~ 2-(i-^)'-) > '^£{QV£{RY-"'. □ 

The main point of considering random dyadic systems is to be able to quantify 
the sense in which the bad cubes are rare. 

Lemma 5.4. Let S> and Q ^ & — 2)^ he fixed, and choose (3' , (3' randomly. Then 

2-r7 

Pf,,,0,{Q&%.d)<2N^—^. 

Note that the right side can be made smaller than any preassigned e > with a 
sufficiently large choice of r e Z+ . 

Proof. This is [25], Lemma 9.2. The fact that the badness is defined with respect 
to 2i'{Q) rather than 3!' does not change the corresponding probability, since both 
these dyadic systems are identically distributed. □ 

Given Q G ^ and n G Z+, the expression Q*^") denotes the dyadic ancestor 
of Q of the nth. generation, i.e., it is the unique cube such that Q C Q^") G S! 
and ^(Q*-"-') = 2^^£{Q). For indicating the appropriate ancestor in a number of 
arguments below, it is convenient to introduce the following integer- valued function: 
for j = 0, 1,2, . . ., let 
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1-7 

where [a;] is the first (i.e., smallest) integer bigger than or equal to x. 
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6. The tangent martingale trick 

In this section I present the central tool for estimating the action on vector- 
valued random sums of various averaging-type integral operators, which will be 
encountered in the sequel. While the applications in the present paper will all 
be in the context of and its dyadic cubes, I decided to highlight the abstract 
nature of this argument by giving the result in a general cr-finite measure space 
{E, /i) having a refining sequence of partitions as follows: For each fc G Z, 
let j!/k be a countable partition of E into sets of finite positive measure so that 
a{j^k) Q ^ and let £/ — IJfeez -^k- 

The basic idea of the tangent martingale trick is the following: Given functions 
/a supported by the atoms A, and of such a form that Ja is f7(^/c_i)-measurable 
whenever A e £/k, these will be replaced by new functions, which have a simpler 
dependence on the variable x € E, being just multiples of the indicator 1a, but 
they still contain all the original information, which is hidden in the dependence 
on a new variable y. 

To make the "replacement" precise, for each A S £/, let i^a denote the probability 
measure ■ /ij^- Let {F,^,v) be the space IlAGi/ ^ with the product a- 

algebra and measure. Its points will be denoted by y = {yA)A(£j^- Then the 
following norm equivalence holds: 

Theorem 6.1. If X is a UMD space and p € (l,oo), then 



nxE 



E 



(6.2) 

Ek J2 ^A{x)fA{yA)'' dP{e)dfi{x)djy{y) 



Proof. This is a version of McConnell's [20 Theorem 2.2 for tangent martingale 
difference sequences. Consider the space fl x E x F, and identify any function, 
subset or collection of sets on one of the components with a similar object lifted to 
the product space in the usual way; so for example £/k is identified with f2 x .s^/k x F. 
Then consider the functions 

dk{e,x,y) := £k ^ fA{x), ek{e,x,y) := Sk ^ lA{x)fA{yA), 

Aes^k Aes^k 

and the cr-algebras ^k ■= '^{{^jiVA ■ A G j > fc},M;-i)- Then both dk and Ck 
are ^^-measurable and, because of the Sk factor, 

E[4|.^fe+i] =E[efe|^fe+i] = 0, 

i.e., they form martingale difference sequences. Moreover, they satisfy the following 
tangent property: their conditional distributions on -^k+i coincide, i.e., 

E[l{,,eD}l^/c+i] =E[l{,,ez?}l^fe+i] 

for all Borel sets D C X. In fact, computing the conditional expectations is easy in 
the present case, since this amounts to fixing the variables Sj and yA, for A G si/j 
and j > k (they do not appear in dk nor e^, so this amounts to nothing), and 
computing the average over x G A for every A S ^fe. But 

P «) M ® H{dk eD}r\{xeA})^P(^ fi{{ekfA{x) e D}), 

¥(g>H^ z/({efc €D}n{xeA})=¥(g, iy{{ekfA{yA) e D}) ■ fi{A), 
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which obviously coincide by the definition of i>. 

Hence McConnell's inequaUty appHes. To be precise, he only formulates it in 
the case of a finite (probability) measure space, rather than a cr-finite one, but one 
immediately checks that his argument works in the present context as well. □ 



The main application of the theorem will be via the following consequence, where 
the auxiliary measure space F has disappeared: 

Corollary 6.3. Let X be a UMD space and p G (l,oo). For each A g £/, let 
kA ■ -A X A ^ C be a jointly measurable function pointwise bounded by 1. Then 



kA{x, z)fA{z) d^{z) dF{e)d^{x) 



fcez Ae^^k 
< 



X 



l^efe ^ /A(x)^dP(£)dMx). 



(6.4) 



Proof. Consider the uniformly bounded sequence of functions 
Kk{x,y) -.^ E ^A{x)kA{x,yA) 



on E X F. By the contraction principle, the right (and hence by Theorem 16. ![ the 
left) side of (|6.2p dominates the expression 



flxExF 



2_^£k lA{x)kA{x,yA)fA{yA) dP(e)d^(x)di/(y), 



fcez Aes/k 
which in turn dominates 



// lE^'^ E (a:) / kAix,yA)fA{yA)di^iy)'' dP{s)d^l{x) 



by Jensen's inequality. Since the innermost integrand only depends on the coor- 
dinate yA of y, the integration over F with respect to di'{y) may be replaced by 
integration over A with respect to di^AiyA) — A'(^)~^ d/x(y^), which completes the 
proof. □ 



It might be interesting to note an alternative approach to this result under 
additional structure on the space X: 

Proposition 6.5. Let X be a UMD function lattice, and p G (l,oo). Then the 
conclusion of Corollary \6.3\ holds even without requiring the f a to be a{£/k-i)- 
measurable for A G ■ 

Proof. The main difference compared to a general UMD space is the existence of 
an absolute value G X for each element G X. This will be exploited via the 
fact that 



/ji:-&|>r(.)H(i:feif'1>/Ji:-ie 



X 



dP(e). 
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Hence, taking into account the bound \kA{x, z)\ < 1, 



LHS^ 



fAiz)\dfliz) dP(£)d^(x) 



X 



< 



nxE 



where the second to last step was an apphcation of the vector- valued Stein inequality 
due to Bourgain 2J, and throughout it was used that the functions /a, where 
A G J24 for a fixed k, are disjointly supported. □ 

7. Separated cubes 

This section begins the lenghty task of estimating various subseries of the expan- 
sion (|2.3p . The numbers e and m will be considered fixed, and the summation ranges 
of dyadic cubes are restricted to the side-lengths e < £{Q),i{R) < 2™ without in- 
dicating this explicitly; it is important that the obtained estimates are uniform in 
these parameters, and the convention concerning implicit constants, as formulated 
in Notation ll.Sl will be heavily employed. In addition to the parameters listed there, 
the implicit constants are also allowed to depend on the auxiliary parameter r £ Z+ 
from the definition of good and bad dyadic cubes in Section [5l This convention will 
be in force until further notice. 

In this section, the binary sequences [3 , (3' , (3 , (3' parameterizing the dyadic sys- 
tems are held fixed, and one deals with the part of (|2.3p where a smaller cube 
Q £ & is separated from the larger R £ Si' hy at least its own side-lenght, 
dist((5,i?) > i{Q). By symmetry of the assumptions, the same conclusion will 
follow for the part of the series with the roles of Q and R interchanged. It will also 
be assumed that all "Haar" functions ipq related to the smaller cube Q are can- 
cellative ones; in any case, the contrary could only happen when t{Q) — £{R) — 2™, 
and the boundedly many pairs of cubes like this will be treated as close-by cubes 
of comparable size in Section [TOl 

Also, one restricts the summations to the good cubes Q G S and R £ S' only. 
This is equivalent to replacing / and g by their good parts 



good 



E 

Qe®goc 



eiQ)<2^ f(Q)=2'" 

and ggood, which is defined similarly. The present aim is then to prove that 



E 



E 

<3e®good 

e{Q)<dist(QM)Ae{R} 



{g,i'R)TRQ{ipQ,f) < ll5llLP'(^;x*)ll/lUnM;^) (7-1) 



where, recall, Trq := (V'fl&2, r(&i(^Q)). 

Lemma 7.2. Let £{Q) < £{R) A dist(Q, R). Then 



RQ 



< 



dist(Q,i?)'^+ 



-Ur\\l^p)\\vq\\l^p)- 
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Proof. This is essentially |25], Lemma 6.1 (but leaving out the last line of the proof, 
where the ||<y5Q||ii(^) was dominated by IJ-iQy^^\\'PQ\\L^{fj,), and similarly with iPq)' 
writing yo for the centre of Q and recalling that / biipq d/i = 0, there holds 



\Trq\= J J ■ipR{x)b2{x)[K{x,y) - K{x,yo)]bi{y)ipQ{y)dfi{y)d^{x) 
and the required estimate follows from (jl.2p . □ 

For the next estimate, define, as in [25| , the long distance of two cubes 
D{Q, R) e{Q) + dist(Q, R) + £{R). 
Lemma 7.3. Let Q E ^good and R E S!' he as in Lemma [7!^ Then 

Proof. This repeats p5l, Lemma 6.4. □ 

To prove (|7.ip . consider first the part of the series where the ratio £{R)/£{Q) is a 
fixed number 2" with n e N, and also 2^ < D{Q, R)/£{R) < 2^+^ for a momentarily 
fixed j S N. The last double inequality will be abbreviated as D{Q, R)/£{R) ^ 2^ . 
If moreover i? £ , the estimate of Lemma 17.31 reads 

IT^ I n(k—n)a/2nka/2 

\-^RQ\ < f ^ ^ 9-nQ/2r,-iQr,-(fc-|-j)<i 

ll^i^llill^Qlli - 2(^+Md+<^) ^ . I/. ; 

In the following calculations, the summation condition dist((3, R) > £{Q) is always 
in force although it will not be indicated explicitly. 
From (1231). it follows that 



D(Q,R)/e(H)r^2' 

II V- V- ^ 



Lp(P(8/j;J>(:) 



D(Q,fl)/^(_R)~2J 

Next, observe that all cubes R e ^^^od with £{R) = 2"£(Q) and D{R, Q) < 
2i+H{R) satisfy 

Indeed, if not, then a contradiction results from (|5.ip and 

2^+^£{R) > D{R,Q) > dist(i?,g) > dist(i?,g("+^+''(^») 

> i£(i?)'>'^(Q("+J+«(j')))i-^ = i£(i?)''(2^+''(j')^(i?))i-T 

> 2-4^(i-'')+'^^'+^^(i?) = 2^+''-H{R). 
Hence the summation over R may be reorganized as 

E - E E ■ 

^6®good,fc Se®,+,+eu) fle^^ood.*, 

RCS 
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For Q, R, S as in the above sums, denote 

rp . o~na/2o~ja II V'i?. II 1 II V^Q II 1 ^ 

TrQ =.2 ' 2 ^^J^ tRQ 

where < {IrqI < 1 by Ai(S') < 2('=+J'+^(j))'^ < 2('=+J)''+J"/2 and (El. 

For each S* g ^k^j^g^^j^ , define the kernel 

Ks{x,y) := ^ ^ Vfl(a;)||V'i?l|iii?Qll<y5Ql|i¥'Q(2/)&i(y)- 

-'^C'^ D{Q,R)/i{R)r^2^ 

Then ^5 is supported on 5 x 5 and [/^^(a;,?/)! < 1, since H^'q ||oo II^'q ||i ^ 1, and 
the same with ipn, and since there is at most one non-zero term in the double sum 
for any given pair of points (a;, y). The quantity inside the L^(P ® fJ-] X)-norm in 
([73]) is 2-("+J')"/2 times 

n+j+e{j) 1 / ^ /• 1 iri)''! f 

E E - E ^//■»(-)^(^)^- 

feo=o kez-k=ko se&k+j+BU) ^ -"^ 

mod n+j+6l(j) + l 

where the fact that {(pQ,f) — ((pQ,Dj^L„/) for Q £ !2k-n was also used. 

For a fixed fco, the series over k = ko mod n + j + 9{j) + l above is exactly of the 
form considered in Corollarv l6.3l 1 5 -b^j"^ -D^^ / is supported on 5 G ^^i.^j^e(^j) , and it 
is constant on every cube Q' e &k-n-i — ^k'+j+e(j)j where k' — fc— (n+j+6'(j)+l). 
Hence the L^{¥ (g) /i; X)-norm of this series is dominated by 

II V is-5r'-<v , ,^ . <ii/i 



fe=feo Se®fc+j.|.e(j) 



using Corollary 16.31 para-accretivity of bi , and the unconditional convergence of 
the twisted martingale differences. 

The full series over k £ Z consists oi n + j + 9{j) + 1 < n + j + 1 subseries like 
this, which implies that the quantity in (|7.5p is dominated by 

Since this is summable over n,j G N, this proves the goal (17. ip . 



Cubes well inside another cube 



This section addresses the part of the series p. Bp . where a smaller cube Q € ^good 
is contained in a substantially larger cube R G ^g^od ^'^^^ ^{R) > 2'^i{Q)- (Again, 
the symmetry of the assumptions allows to deduce the same final result also for Q 
and R in opposite relative positions.) Hence, the relevant part of the series of is 

E {9,i'R)TRQ{VQj) (8.1) 

e{Q)<2"'e{R) 

where, as before, Trq := {ipph2^T{biipQ)) . In this section, we deal with a modifi- 
cation of this series, with Trq replaced by 

Trq := Trq - {b2,T{biipQ)){ipR)Q, 



THE VECTOR- VALUED NON-HOMOGENEOUS Tb THEOREM 



27 



postponing the treatment of the correction term. Hence the goal of this section is 
reduced to proving that 

I J2 E (5,V'fl)riiQ(^Q,/)| <||5lLp'(^;x.)ll/IU''(/.;X). (8.2) 

e{Q)<2-''e{R) 

This will follow a similar strategy as in Section [7l starting from the estimation of 
the matrix elements Trq. 

Lemma 8.3. Let Q be good, Q C R, 1{Q) < 2-''£{R), and let S £ ^' be the 
subcube of R containing Q with £{S) = £{R)/2. Then 

Trq = -{ijR)s{^S-=b2,T{biipQ)) + ^ {■ipR^S'b2,T{biipQ)) 

S'gS>';S'cR\S 
£{S')=i{R)/2 

1^ l< /'^('5)\"/Vl/ / \ I , II 

\Trq\ < (^j (m)s\ + ^(^) J II^qIU^(.) 

Proof. Concerning the equality, the fact that ipR is constant on the subcubes of R 
implies that 



LHS = {^Rls'b2,T{b,^Q)) - {4'R)s{b2, T{b,^Q)) = RHS. 

S'e®';S'C-R, 
e(S')=i{R.)/2 

As for the upper bound of the first term, 



-di^t(5^"'^^"^- 



the first estimate is similar to Lemma [7.21 and the second follows by splitting the 
integration into dyadic annuli 2*^ < dist(a;, Q)/ dist(S''^, Q) < 2*^+^, fc 6 N, and using 

ti{{x : dist(a;, Q) < 2^ dist(S'^ Q)}) < (Q) -f 2^ dist(S'^ Q)Y < 2^"^ dist(5^ Qf . 

The last bound was due to the goodness of Q, and for the same reason (and noting 
that 7 < 5) 

dist(S'^ Q) > l^qyii^SY-'' > £iQy/'^i{R)^^\ 

which concludes the estimation of the first term. 

For the second term one can apply Lemma 17.31 with iJjrIs' in place of tJjr, ob- 
serving that nothing but the support and integrability properties of ipR were used 
in the proof. This gives 

£(0)"/'^£(S')"/'^ 

\{^PRis'b2,T{bi^Q))\ < %[q syL ll^^llilly^glli 

- \£{S')J £{S'Y ' 

and the proof is concluded by noting that £{S') = £{R)/2, and hence £{S'Y > 
^i{R). □ 
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Lemma 8.4. Under the assumptions of Lemma \8.3\ 

Proof. For the second term in the estimate of |T/jq| in Lemma [8.31 this is clear. 
For the first term, one has to look more carefully into the structure of the function 
iJjR, recalling that ^pn = ^'b.v ^'^^ some u G {1, . . . , 2^}. Let S = Rw 
If Tj = w, then for x G Rw, 

\^Rix)\^\{^PR)RJ<^iiR^r'/^ 

so that \iIjr{x){^b.)r,, \ ^ f^{Rw)^^ , whereas for x G R\Rw, 



li^Rix)] < 7^^' \'^Rix){i'R)R^ 



If V ^ w, then for all x E R 

KRvf'^ 1 _ 1 



\{'^r)rJ ■ HrWc 



< 



(8.5) 



LP(P(8/j;X) 



which is even slightly better than the worst case scenario v — w. □ 

To prove (|8.2p . consider the part of the sum with w G {1, ... ,2^} fixed and 
Q C Rw Let further n G {r + 1, r + 2, . . .} be fixed, and £{Q) = 2'"i{R). 
By (|2.5p . one gets 

I XI 51 51 {g,i'R)TRQ{ipQj) 

^ 11.911 II E E E v-flT^^gi^Q,/) 

QcRu, 

For each fc G Z and R & &'f. define the kernels 

:=2""/2 E KR)^R\Rjx)^Rix)TRQiPQ{y)h{y), 
QC-R™ 

Then if™* is supported in R x R and ii')^ in i?^, x R^, and they satisfy 

ii^nioo + iiifi?iioo < 1 
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by Lemma [8^ Moreover, 

E^'^ E E V'i?(a;)ri?Q((^Q,/) 

QdRu, 

= 2—/^;^,, E ^ / ^r(-,y)^^^(2/)dM.) (8.6) 

The aim is again to apply Corollarv l6.3l However, there is an initial obstruction 
(which I had originally overlooked, and which was kindly pointed to me by Antti 
Vahakangas): the kernel functions -fC™* and K'^^ are adapted to the dyadic cubes 
R & and R^ G ^l-i^ whereas B^^_jj//6i is constant on the cubes Q G i^/c-n-i, 
and this dyadic system is not a refinement of the other one. The way out is to 
define somewhat finer partitions of by 



so that the generated cr-algebras cr((o'^) = S^k+r+i) again form a filtration. 

The important point is to observe that 



^fe,good c 4', {Ru, e -.R^cRe ^I.,good} c (8-7) 



Indeed, R G ^^^^ means in particular that R does not intersect the boundary 
of any Q G S^k+n and hence also i?^, C R cannot intersect the boundary of any 
Q G ^^(^k-i)+{r+i)- On the right of (j8.6p . one may hence replace the first summation 
condition i? G g^^^j by i? G simply setting K'^^ := for all the new sets R 
thus introduced. The second summation in (j8.6p could be equally well taken with 
respect to the summation variable R^, and (j8.7p allows to write the summation 
condition as R^ G again defining the newly introduced kernels K^^ := 0. 

Since B^^_„//6i is constant on the sets Q G it is a fortiori constant on 

the smaller sets S HQ & '^k-n-r-2- 

Splitting the fc-series in (|8.6p into n + r + 2 subseries according to fc = fcg 
mod n + r + 2, everything is now ready for the application of Corollarv l6.3i just as in 
Sectionlll This provides the upper bound C2-""^'^{n + r + 2) for the LP{P fi; X)- 
norm in (|8.5p . and it is possible to sum over n G {r+1, r+2, . . .} and w G {!,..., 2^} 
to conclude (|8.ip . 
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9. The correction term as a paraproduct 

It is time to take up the consideration of the correction term subtracted at the 
beginning of the previous section, namely 

X! X! ^{9.i^R,v){i}R,v)Q{b2,T{hiipQ^y,)){'^Q,u,f) 

e{Q)<2-''e{R) 

- E ( E 

2''£(Q)<£(i?)<2'" 

+ «^5/62)Q)(r*52,P^QV), 
2'-^(Q)<^(_R)=2'" 

where the previously suppressed variables u,v, as well as the implicit restriction 
to side-lengths at most 2"*, have been momentarily taken back into consideration. 
Were it not for the restriction to the good cubes R only, recalling the definition 
of D^^ as a difference of two E^^ 's, the inner summation on the right would be a 
telescopic one, collapsing to 



(E5j^5/62)q - {9)R/{b2)R, 

where i? e f^' is the unique cube such that £{R) — 2^£{Q) and Rd Q. 

It is here that, for the first time, the random choice of the dyadic systems comes to 
rescue. The double sum on the right of (j9.ip is of the form considered in Lemma [5.21 
which states that, on average, the restriction to good cubes is irrelevant as far as 
the bigger cubes in such a sum are concerned. More precisely, using the observation 
concerning the telescopic series, it follows that 

E^^sdllD = ^goodE^ J2 E #y^(r*62,e^V), 

Qe®good R&S>',RDQ ^ 
l{R)=2''e{Q) 

where the result was intensionally written as a double sum, even though the cube 
R is uniquely determined by Q. This was done in order to realize that Lemma [5T2l 
may be applied again, leading back to the restriction into good R only, but now in 
the collapsed series obtained: 

E,^ra=E,^ Y: E ^(T*b2,B'-f) 

e{R)=2'-e{Q) 

= % E E T.7i^(T*b2MVQ,u){^Q,uJ). 

e{Q)=2-'-e(R) 

Now that this computation is done, the summation over u is once again sup- 
pressed, one forgets about the average E^^, and turns into considering the above 
expression for an arbitrary but fixed choice of the (3 parameters. The above sum 
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will be interpreted as a pairing (112(7, /)i where 

^^9--= E E j^{T*b2.h^Q)-^Q (9.2) 

£(Q)=2-'-f(_R) 

is the paraproduct, similar to the one in [25j . Section 7.1, but also different in 
certain respects. Nazarov et al. allowed somewhat more summands on the right 
by imposing only the condition dist((5, i?"^) > X£{Q) for the two cubes Q ^ & and 
R E S^' with side- lengths related as above. This is a consequence of Q e ^good, 
which implies that dist(Q,9ii) > ^e{Q)'^£{R)^-^ > \ ■ 4Xe{Q), and the difference 
between these two conditions is inessential for the present considerations. 

More important, however, is the fact that one can also restrict the bigger cube 
i? to be a good one here. This condition was absent from the paraproduct treated 
in earlier versions of this paper, which I only managed to do with the help of 
Therem 13.51 and the additional assumption of the RMF property of the Banach 
space X*. I still do not know whether the variant of the paraproduct with R £ ^' 
could be treated without the RMF condition, but the reduction above shows that 
one only needs to consider 112 as in (|9.2p . 

This paraproduct is related to, and should be controlled in terms of the BMO 
function T*b2- The membership in BMO will be exploited via the following esti- 
mate: 

Lemma 9.3. For p e (1, oo) and h e BMO^(^), 

E ^Qih,bi^Q)'PQ ^ m(^)^^^1|/j1Ibmo5(m)- 

i{Q)<2-''e{R) 

Proof. Consider the Whitney-type covering W oi R consisting of the maximal 
dyadic cubes S £ & subject to the conditions £{S) < 2^^i{R) and 

dist(5,i?") > A£(5). (9.4) 

Then the expanded cubes \S satisfy the bounded overlapping property 

Ixs < CIr. 

If Q is one of the cubes appearing in the sum on the left of the assertion, the 
goodness of Q implies that 

dist(g,i?^) > ^£{Qy£{Ry-' > 2-12'-(i-tV(Q) > Xe{Q), 



where the last estimate used (|5.1|) . Hence Q is contained in a maximal cube with 
this property, i.e., in some S Cz W. 

Without loss of generality, take ||/i||BMOj(/j) — 1- By the definition of BMO^(/i) 
and the boundedness of bi , there holds 

\\ls{h-{h)s)b,\\l,^^^<^,ixs). 

Consider the "Haar" coefficient {ls{h — {h)s)bi,(pQ). If Q C S", this coefficient 
equals {h,biLpQ), since bi(fQ is supported on Q C S and has a vanishing integral. 
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Hence it follows from unconditionality that 



Qe®good 

QCS 



p 



< 



LP{P(g,fj.) 



/i(AS'). 



(9.5) 



But the sum over S' G 5^ of the left side of the previous estimate coincides with 
the left side of the assertion by the observation that every Q there is contained in 
exactly one S £ On the other hand, the sum over the right hand side is 



and this completes the proof. 



□ 



Now everything has been prepared for the main result of this section: 
Theorem 9.6. Under the standing hypotheses, there holds 
m29\\p'<\\T*b2\\^^^.^^Jg\\p'<\\9\\p'. 
By the considerations at the beginning of the section, this shows that 



J2 E {9,i^R){i^R)Q{b2,Tih^Q)){VQJ) <\\9\\p'\ 



e(Q)<2-'-e(R) 



Proof. Denote for short q :— p' and 



E {b2,T{hnpQ))—— -ipQ 

QeSisaod;QcR ^ '^'"^ 

1{Q)=2-'- t{R) 



(9.7) 



Because of the unconditionality of the system {(^sglgggj, it follows that 



\\^2g\\Ll{t,;X*) < E ^R^Ri9)R 



L<J(P(8/i;X*) 



(9.8) 



and this is of the abstract paraproduct form (|3.3[) . although not yet with the mca- 
surability condition required to apply Theorem 13.41 This deficit will be repaired 
with the help of the refined filtrations as in the previous section, and a direct ap- 
plication of the tangent martingale Theorem 16. II (as opposed to its indirect use via 
CoroUarv 16.31 like in the last two sections). 
Turning into the details, let 



and 



where the containments are immediate from Lemma 15.31 A subsequence of the 
corresponding cr-algebras form a filtration, as 



(7{Sk+r) C Cr((ffc) C (T[S'k- 
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Defining $_r := whenever R & S"!. \ 
continued with 

k Res; 



^good.fc' estimate in ([97 



may be 



/co— k=kQ 

mod 2r+l 



L9(dP(6) d/t(x) di/(a):X*) 



where the measure v — Vko on the product space rifc=fcn Ilfle^' (with typical 
point denoted by y = (y7?)i?,) is defined as in Theorem 16.11 The last estimate was 
an application of the mentioned theorem to each of the subseries with a fixed fcoi 
observing that each is supported on i? G ff^, zero on Q e i^bad,fc-r and contant 
on Q G ^k-r-i when Q^^^ G ^^good, k-r', hence constant on5 G fSfc-2r-i- 
Now each 

ReSi 

interpreted as a function of a: G with values in L'(dz^(2/)), is obviously cr((?^)- 
measurable, so that Theorem 13.41 is applicable, taking Xi = X*, X3 = L'^{v\X*) 
and X2 — L'^{v) C ^{Xi, X^) in a canonical way. It guarantees that 

\\^2g\\Li(KX') 

llffl!L<!(M;X-) II 



< 



, k=ko 



2^ ^R{yR)'^Rix) 



j=ko 
]<k 



R£S- 
RCS 



L<i{dV{s)dfi{x)di^{y)) 



(9.9) 



and another application of Thcorem lG . 1 [ permits the replacement of ^R{yR)lR{x) by 
simply $fl(a;). Also, the summation condition R G S'j may obviously be replaced 



by i? G 



goodj- 



as the remaining terms are zero by definition. And then one 



rearranges the summation in terms of the maximal cubes P G '^^'^^ad j appearing 
in this summation. The additivity of the integral on disjointly supported functions 
implies that 



j = ko 
3<k 



p 



ReS- 
RCS 



E 

p 



E 

log2 l(R) = ko 

RCP 



Li(P0tj.) 



E 

QG@good; QCP 
^(Q)<2"''^(P) 
log2e{Q)=ko-r 



eQ{T*h2,bnpQ)ipQ 



L9(Pi8)/2) 



where the last line was essentially just writing out the definition of each ^r. Now 
Lemma 19.31 and the maximality of the cubes P, all of which are contained in S, 
make the estimate continue with 

< 5:Mi^)r*62||^MO^(,) < M^)||T*62||^MOl(.)- 



The proof is completed by recalling from Nazarov et al. [25], Section 2.3, that 
the assumption ||T'*&2||BMo5^(/i) ^ 1; combined with the other hypotheses of Tb 
theorem[Tl already implies that ||T*&2||BMO«(/i) ^ 1 ^r all q G [l,oo) (which would 
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not be true for an arbitrary h G BMO;s^(/i) in place of T*62), and substituting back 
to (EH). □ 

10. Close-by cubes of comparable size 

The part of the series (|2.3p which has not been addressed so far consists of the 
pairs of good cubes Q, R which are close to each other both in terms of their position 
and size; more precisely, 2-n{R) < e{Q) < T-e{R) and dist(Q,i?) < 1{Q) A£{R). 
In this section, a certain portion, determined by a new auxiliary parameter rj, of 
this remaining part will be estimated, and accordingly, the implicit constants here 
are allowed to depend on both r and rj, in addition to the parameters listed in 
Notation 11.51 Only the size, and not the cancellation, properties of the "Haar" 
functions will be exploited here, so the estimates are equally valid for both types 
of functions, ipq g and (pg „, w > 1, appearing in (|2.3p . 

Given R G ^good' there are only boundedly many cubes Q G f^good like this, and 
thus it remains to consider a finite number of subseries 

{9,i^R)TRQ{ipQj) = i9,i^B){i^Rb2,T{biipQ))(ipQ,f), (10.1) 

where Q = Q{R). Fix one such series; the convention that Q is implicitly a function 
of R will be maintained without further notice throughout the rest of this section. 
Without essential loss of generality, it is permissible to act as if the map R t-^ 
Q{R) was invertible, so that the same series (|10.1I) could also be written with the 
summation variable Q G i^good, with R ~ R{Q). In reality, it may happen that 
some Q has no preimage i?, or that there are several preimages. But in the first 
case one may simply interpret the corresponding terms as zero, and in the second 
case the number of preimages is nevertheless bounded, so that one can always split 
the summations under consideration into boundedly many subseries and proceed 
with the triangle inequality; such technical details will not be indicated explitly. 
Observing that 

bUPQifQj)^ Y bllQ'{ipQ)Q'{ipQj)—. Y bllQ'CQ'if) 

Q'e&,Q'cQ Q'e&,Q'cQ 
i{Q')=e{Q)/2 e{Q')=e{Q)/2 

and similarly 

b2il'R{ipR,g) ^ Y b2lR'dR'{g), 
R'e&',R'cR 

l(R')=l(R)/2 

the series (|10.1I) splits into (2^)^ subseries of the form 

E dR{g){li,b2,T{WlQ))cQU). (10.2) 

where Q = Q{R) is a possibly different function of R from the one before, but still 
with the property that 2-''i{R) < t{Q) < T'l(R). 

As in [25], for each cube Q, define the boundary region 



5q (1 + 277)Q \ (1 - 27?)Q, 



(10.3) 



THE VECTOR- VALUED NON-HOMOGENEOUS Tb THEOREM 



35 



where the new auxiUary parameter 77 > is to be chosen. Then, for each Q 6 
its bad part is defined by 



lhad 



:-Qn 



u 

2-''<t.(R)/t.(Q)<2'' 



Sr), 



while for i? £ i^' a similar definition with the obvious modification is made. (Note 
that, just like in .25j, this is different badness from the one considered in the previous 
sections: some cubes, as entities, are good while some are bad, but all cubes, 
whether good or bad, have their bad part in the sense of the above definition.) 
Given R & 2>' and Q = Q{R) £ 2> appearing in the sum (|10.2p . let 

A -nn/? Qscp := Q \ A \ <5i^, Qa := (Q \ A) n 5^, c Qtad, 
^ ' i?scp :=i?\A\(5Q, i?a:=(i?\A)n,5QCi?b,d 

so that there are disjoint unions 

Q = A U Q,ep U Qa, i? = A U i?scp U Rq. 

Then the matrix coefficient in (|10.2p can be written as 

(li?&2,T(6ilQ)) ={ln^^^h2,T{b,lQ)) + {lR,b2,T{h^lQ)) 

+ (lA62,r(foilA)) (10.4) 
+ (lA&2,nfeilQj) + (1a62,T(6i1q^„J). 

The second and the fourth terms on the right of ()10.4p correspond to the bad 
parts, and will be left alone for a while. The middle term satisfies 

(1a&2,T(6i1a)) TaMA), |Ta| < 1, 

as a direct application of the assumed rectangular weak boundedness property of 
Mb^TMb;^, since A = Q n i? is clearly a rectangle. Hence 

|E^^(-9)(1a62,T(&i1a))cq(/)| = I jY,lRdR{g)T^CQ(J)lQAiJi 
= I j j ^£R^RdR{9)^eR,Ti:^CQ(R,){f)lQ(R,)df{e)dii 



- \^'^R^RdR{g)\ 



< 



< 



^SR^Rwii^Rw^g) 



^£QTACQ{f)lQ 



LP'( 



Q 

.m;X-)I 



^^Q'PQwi'fiQW, f) 

Q 



LP( 



\g\\Lp'{fi-X') \\f\\LP{fj.;X), 



where, in the second to last step, the contraction principle was used both to remove 
the bounded factors Ta and to dominate the functions 1qcq(/) — 1q</3q(i) {ipQW , /) 
by the right-hand side without the 1q, and similarly on the g side. 

Now consider the first term on the right of (|10.4p : the fifth terms is essentially 
similar, the main point being that the two indicators in both terms correspond to 
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sets separated from each other. By 



|(lfl.„p&2,T(6ilQ))| = 



< 



b2{x)K{x, y)bi{y) dfi{y) dfi{x) 



Write 



(lfl..,&2,T(6ilQ))=:TQ 



dist(i?scp,Q)'* ~ ^(g)'^ 



\Tq\ < 1. 



Then 



E^^(5)(ifl..p^2,n&iiQ)>cQ(/) 

i? 

=: ^{9,i^Rw)\\^Rw\\LHp)j^^yQa)\\LHp){VQn) J), 

R 

where also \Tq\ < 1. Reindexing the sum, so as to write simply Q and R instead 
of Q*^^^ and R'^^\ reduces the considerations to the series 



XI , V'i? ) 1 1 '/'fl 1 1 L 1 (p) 1 1 V'Q 1 1 L 1 (m) ('^Q ' /) ' 



(10.5) 



where \tQ\ < 1. By UM, 



LP{P<S,ii;X) 



< 



'^£q'^R{Q) 
Q 



(10.6) 



By symmetry, one may assume that e{Q) > i{R), hence £{Q^'^^°'>'>) > 2''i{R). In 
order to apply the tangent martingale trick, one checks that R C Q'^^\ Indeed, if 
not, then 

£{R) > dist(i?,Q) > dist(i?,g('')) = dist(i?, ^Q*'')) 

> ^(i?)^(^(Q('')))i-^ > 2'-(i-^'^(i?) > £iR), 

a contradiction. 

Hence, reindexing the summation in terms of S = Q^^\ 



Ks{x, y) T—^^y) ^^(y) 



Lp( dP(e) dii{x)-X) 



where 



Ks{x,y)= X lR(Q){x)\\ipQ\\Li(^^,)LpQ{y)hi{y) 
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is supported in S* x S* and HXsHoo ^ 1- As before, bi^'M)^f}_^f is constant on all 
Q' e so that splitting the k summation into r + I subseries according to 

k = ko mod r + 1, and Corollary 16.31 applies to each of these. The conclusion is 

E'^«(5)(U.p&2,r(6ilQ))cQ(/) < llffll^, 

'(p;X*)II/IUp(m;^)' 

R 

and the same is true with 1r^^^ and Ig replaced by 1a and Ig^^p, as argued above. 

11. Bad boundary regions 

It is time to encounter the bad parts, which were avoided until now, in order to 
complete the proof of Tb theorem [TJ In this section, the implicit constant are still 
allowed to depend on r, as before, but any dependence on the auxiliary parameter 
r] from the previous section, which was used to define the depth of the boundary 
regions Sq in (|10.3p . will be stated explicitly. In estimating the expansion 

^ {g,i^R)TnQ{vQj), (11.1) 
Qe&,ReS>' 

the following inequalities, following the convention about the implicit constants just 
stated, have been obtained so far: 



J2 E (5,^fl)TflQ(^Q,/) <C\\g\\p,\ 

e{Q)<dist{QM)Ae{R) 



llpl 



llpl 



J2 E {9,i'R)TRQ{^Qj) <C\\g\\p. 

«e®good Qe^B^d.Qcfl (11.2) 
e{Q)<2~'-e{R) 



E 

dist(Q,R)<e{Q)/\e{R) 
2-''<i{Q)/e{R)<2'' 



where T^q'^ is the part of the coefficient Trq corresponding to the first, third 
and fifth terms in (|10.4p in the decomposition performed in the previous section, 
and Tqr is the modified matrix entry with the paraproduct removed, as treated 
in Section [8] In addition, it was shown in Section [9] that the correction terms 
Tqr, — Tqr satisfy the similar bound on average, in the sense that 

^PfJ E E {9,y^R){TQR-fnQ){^Q,f)\<C\\g\\p,\\f\\p. (11.3) 

eiQ)<2-''e{R) 

Note that, were it not for the labels "good" in various places in (lll.2p and (|11.3p . 
these subseries would cover the half of ()11.1|) with i{Q) < £{R), and in fact a bit 
more in the case of close-by cubes. By symmetry, it hence remains to treat the 
bad cubes, and also the bad parts of the matrix coefficients, T^'q = Trq — T^q'^ 
corresponding to the second and fourth terms in (|10.4p . which were left out in the 
last hne of (fTL2| . 

The treatment of these remaining terms will be similar to the estimate (|11.3p . 
in that control is gained only after averaging over the dyadic systems. But there 
is also the important difference, that the bounds will now depend on the operator 
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norm ||r||_jf(ip(^.jf)) , and one needs to get a small factor in front of it in order to 
eventually absorb it into the left side of the final inequality. 

The estimation of the bad parts will be based on the fact that every UMD space 
has cotype s for some s G [2, oo), i.e., satisfies the inequality: 



l/s 



< 



and then on the following improvement of the contraction principle (corresponding 
to i = oo below) under this extra condition. Note that the previous estimate (with 
the usual modification) is always true for s = oo and never for s < 2. 

Proposition 11.4. Let S X, where X is a Banach space of cotype s G [2,oo) 
and let 6j G L*{Q) for some a-finite measure space and t e (s, oo). Then 

oo 

3 = i 



Proof. By approximation, it suffices to consider finite sums I < j < n. This result 
can be found in [TS], Lemma 3.1. □ 

Now turn to the bad analogue of the last series in ()11.2p . and more precisely to 
the part of the series (|10.2|) with the second term from ()10.4p . {liigb2,T{bilQ)) , in 
place of (1k62,T(6i1q)). 

Lemma 11.5. Let X* have cotype s and take t > sM p' . Then 



^ dR{g){lR,h2,T{b^\Q))CQ{f) 



^ V^^^\\T\\j^(LP{,j.;X))\\g\\LP'{tJ,;X')\\f\\LP(p;X) 

Proof. First randomize and use Holder to the result that 

I dR{g){lR,b,,TihlQ))cQ{f)\ 



/ ( E esdsig)lsob2,T(^ ^ eRCQif)b,lQ 



good 

- II E ^sdsi9)lsab2 





ll^( 


LP'(I 





LP(r(g)fi:X) 



From the second factor, one may extract ||r||^(ip(^.x)) , and then by the contraction 
principle and unconditionality 



E ^QCq(/)6i1q 



As for the first factor, write 





< 


Lp{r 


»kX) II 



Sik) 



Qesi 



fe+r 

u U^. 

j=k-r Qe@i 



LP 



X) 
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and then (dropping 62 by the contraction principle) 



Sa\ 



ses>' 



Lp' (V(g)^t-X*) 



feez 



-Re®: 



< 



^£fel5(fe)(x) ^ dfl,(g)lfl(a 
fcez -Re®; 



p7* 



Lp' (P®;i:X*) 



for t > p'. 

For each fixed x G 
tion lll.4i with 



, the integrand is of the form considered in Proposi- 



where R{x, k) is the unique R G containing cc. (There is now an norm 
instead of the norm on the probability space (r2,P), which is however irrelevant 
thanks to Kahane's inequality.) The random variables \s(k){x) — functions of 
the implicit variable /3 G ({0,1}^)^, which governs the distribution of the random 
dyadic system and hence of the boundary regions 5{k) — obviously belong to all 
i*(({0, 1}^)^) for all t e [1, 00], and satisfy 

With a choice of t as in the assertion, Proposition 1 1 1 .41 then implies that 



Sa 



seS" 



-Re®' 



Lp' (P^li;X*) 



and this is dominated by V^^^WdW^p' (^^-x") by similar contraction principle and un- 
conditionality arguments as before. □ 

The case of the fourth term from (|10.4p is analogous (the only break in the 
symmetry being one more application of the contraction principle to estimate 1a 
by Ir in the appropriate place), and I only state the result, leaving its verification 
as an easy excercise along the lines of the previous proof. 

Lemma 11.6. Let X have cotype s and take t > sV p. Then 



E 



/3' 



J2 dRig){lAb2,T{b,lQa))cQif) 



Qe®g 



< V^^*\\T\\j^(LP{f,-X))\\g\\Lp'{p-X*)\\f\\LP{f,-X) 

The results of Lemmas 111.51 and 111.61 may be summarized as 



-Re®' . 



E 



{g,^R)Thh'{vQj) 



, Qe®good 

dist{Q,R)<e{Q)/\e{R) 
2-'-<^(Q)/f(-R)<2'- 



(11.7) 



<^^^'/*ril^(L.(^;X))ll5llp'll/IU 

for any t, which is bigger than both max(p,p') and the cotypes of X and X* 
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12. Synthesis 

The proof of Tb theorem [1] wiU now be completed. This also involves choosing 
appropriate values for the auxiliary parameters r and rj. Hence any dependence 
on these numbers will now be indicated explicitly, and any constant C may only 
depend on the parameters as listed in Notation 11.51 

Given a function / G X), define its good and bad parts 

/a:= »q/+ E ^q/, AG {good, bad}; 

an analogous definition is made for g € (p; X*). Observe that the decomposition 
/ = /good + /bad dcpcnds On the random parameters f3 (which determines the dyadic 
system ^) as well as (3' and f3' (which determine the goodness or badness of a given 
Q G S)); likewise, the splitting g — ^good + 5bad depends on /?', /3 and /3. 

Having assumed that T is a bounded operator, fix some compactly supported 
/ G LP{^;X) and g G LP'{fi;X*) so that 

<2(5,T/), = 1. 

One first expands 

(g^Tf) = (c/good, r/good) + (.ggood, r/bad) + {ghad,Tf). 

The left side is independent of the parameters f3, f3' , (3, /?', which govern the splitting 
on the right. Now take the expectation E^^/^^/ of both sides of the previous equality. 

Collecting the estimates from the previous sections, summarized in (|11.2p . (|11.3|) 
and (|11.7|) . and writing out the dependence on r, which was suppressed in the 
implicit constants until now, it has been established that 

l%/3',3,3'(5'good,r/good)| < C(77,r) + C(r)77^/*||r||^(ip(^.x)). 

Indeed, the mentioned estimates involved averaging over some of the [3 parameters 
only, being uniform with respect to the other ones, but such inequalities clearly 
imply the weaker version, where all these parameters are averaged out. Altogether, 
using also that ||</good lip' ^ llsllp' = 1; it follows that 

l|r||^(L.(^;X)) <C(77,r)+C(r),?i/*||T||^(^^(^^^j^ 

+ C\\T\\se(LP(ti:X)) (E/3'%^ll5bad||p' + %E^'^/||/bad||p)- 

It remains to estimate the expectations of the bad parts, which is very similar 
to the previous section, for instance the second one: 

E/3/^/||/bad||LP(A,;X) 

t ip/t , aVp 



< 



V^'l E^Qib?d'(Q)i^QVW||^ ]' M^)) 



By Lemma the random variables [(3' ,j3') l(^g^ (Q) satisfy 

l|lbad(Q)||L*(({o,i}^«)-) = P/3'/3'(Q is bad)i/* = e(r). 
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where e(r) — > as r ^ cxd. Hence, by Proposition 111.41 with t > sW p, where X 
has cotype s, it foUows that 



< 



LP{P(g>ti-X) 



e{r)\\fh.i^,x). 



Using the similar estimate for (?bad, and substituting back to (|12.1|) . it follows 
that 

\\T\\^(L.ip;X))<Cir,r,) + C{rW^'\\T\\ ^{LP{p.;X)) + Ce{r)\\T\\j^(^Lv{p.;X))- 

Now one first fixes a large enough r so that Ce(r) < \. Then one picks a small 
enough ri so that C{r)'q^/^ < i. Thus 

\\T\\j^{LP{p.;X)) < C{r,T]) + + ^)\\T\U(LPit,:X)), 

and this completes the proof of Tb theorem [T] 



13. Operator-valued kernels 

This section explains the extension of Tb theorem[l]to the case of operator- valued 
kernels K{x,y) G ^{X), as stated in Tb theorem S) Following the "Rademacher 
rule of thumb" for operator-kernels mentioned in the Introduction, define a d- 
dimensional Rademacher-Calderon-Zygmund kernel as a function K{x, y) of vari- 
ables x,y G M.^ with x ^ y and taking values in ^{X), which satisfies 

I%{{\x - yfK{x,y) : x,y eR'' ,x y}) < 1, (13.1) 

\y \x - x\ \x - x\ (13.2) 

x,x' ,y e R^, |x- y| > 2\x-x'\ > o|) < 1 

for some a > 0. Recall that ^(,5^) designates the Rademacher-bound of the set 
^ as defined after (|1.8p . As in the scalar case, multiplicative constants could be 
allowed in these conditions, but will be supressed. 

Let T : f ^ Tf be a linear operator acting on some functions / ; X ot 

f : C, producing new functions Tf : X m the former case and Tf : 

R^ -> ^{X) in the latter. If ^ e X and : R^ ^ C or : R^ ^ ^{X), define 
the function F O ^ : R^ ^ X by (F ® ^)(a;) := F{x)^, where the last expression 
is the product of a scalar and a vector, or the action of an operator on a vector, 
respectively. With this notation, suppose that T((pCg)^) = (Tip) Cg)^ for : R-'^ — > C 
and ^ £ X. The adjoint T* is defined via the duality {g, f) = J {g{x),f{x)) d^{x) 
between functions / : R^ X and 5 : R^ -> X*: ior lp,^|; : ^ C, ^ e X and 
r G X*, 

and hence (T*-0,(/3) — Tip)Y e ^{X*) for scalar-valued functions V', V'- 

Such a T is called a Rademacher-Calderon-Zygmund operator with kernel A" if 

r/(a;)= / i^(x,y)/(y)dAi(2/) (13.3) 
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for X outside the support of /. An operator T is said to satisfy the rectangular weak 
Rademacher-boundedness property if there holds 



< 1. 



Recall that in Tb theorem lU this assumption is made for Mb^TMi,^ in place of T, 
where 61,62 are two fixed weakly accretive functions. As in the scalar-kernel case, 
the simplifying assumption is made that T already defines an a priori bounded 
operator on L'P{pL]X). 

At this point, one can already explain the modifications in the proof of Tb 
theorem [1] except the part involving the paraproduct, which are required to get the 
operator- valued Tb theorem 21 It is very simple: one just repeats the same proof, 
and the assumed Rademacher-boundedness conditions ensure that whenever one 
"pulled out" bounded scalar coefficients from the randomized series (which persist 
throughout the arguments), the same can be done with the operator coefficients 
by the very definition (II. 8p . This is by now completely standard in the study of 
operator- valued singular integrals (cf. [11] Uni HH] ) j and it would be redundant to 
say anything more here. 

One still has to make sense of the actual Tb conditions and comment on their 
role in handling the paraproduct part of T in the operator-kernel case. To this end, 
observe first that formally 

{T*b2MVQ) = {{b2,T(b,^Q)))* 

= ((i2Q62,r(6ivPQ)))* + ((i(2Q)c62,n&i^Q)))* e J^{x*). 

The first term is the adjoint of the operator 

i^x^ (I2Q62, r(6i</.Q))e - (I2Q62, r(6i<^Q ® 0) e 

where the right side is well-defined, since / := bitpQ(E)£^, and then Tf, is in X) 
and I2Q62 € LP (/i). By (|I3.3p and the fact that bi(pQ has a vanishing integral, the 
second term involves the pairing 

b2{x) / K{x,y)bi{y)ipQ{y) d^j.{y) d^J,{x) 

{2QY JQ 



b2{x)[K{x,y) - K{x,yQ)]bi{y)ipQ{y) dfi{y) d/i(x), 



where yg is the centre of Q and the ^(X)- valued double integral converges abso- 
lutely by (|I3.2p (even the uniform boundedness instead of Rademacher-boundedness 
would suffice here). 

In Tb theorem m it was assumed that 

This condition can be interpreted as follows: There exists a function 

h2 e BMO^' (//; Z) C BMO^' {^i: ^{X*)) 
of norm at most f , such that 

{T*b2,bi(pQ) = {h2,biipQ) = / h2ix)bi{x)tpQ{x)dx 
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for all "Haar" functions (pg. The space BMO^ (/i; Z) is defined just like the scalar- 
valued version (see (II. 4p ). only using the norm of Z in place of the absolute value. 
Similarly one interprets the condition that Tbi = hi E BMO^(/i;F). 

Lemma 13.4. For p G (l,oo) and h G BMO'^(/i;Z), where Z is a UMD space, 
there holds 



E 

Qgg,good.Q^j^ 



Proof. The proof is exactly the same as that of Lemma 19.31 dealing with Z — <C\ 
indeed, only the UMD property of C was employed there. □ 

Theorem 13.5. Under the hypotheses of Th theorem^ 

l|n2<?|!^(LP'(,,;X*)) ;$ ll'^*^2||BMof (p;Z) ^ 

Proof. This repeats the proof of Theorem 19.61 The initial considerations and the 
application of the tangent martingale trick of Theorem l6.1l for X*- valued functions 
work in exactly the same way as before. In place of (|9.9p , one applies the abstract 
paraproduct estimate of Theorem 13.41 with Xi = X* and — L'^iv; X*) (q := p') 
as before, but now with X2 := L'i{v] Z) C ^(Xi,X3), recalling that Z C ^{X*). 
And the right side of (19. 9p is then estimated as before, with Lemma [13.41 in place of 
Lemma 19.31 Finally, note that the last inequality of the assertion is here assumed 
as stated, contrary to the scalar-kernel Theorem 19.61 where it was deduced from 
I|2^*^2||bmoi (m) — 1 other assumptions by the results of Nazarov, Treil and 

Volberg [2^ . □ 

Thus all the arguments of Th theorem[T]carry over to the setting of Th theorem|4l 
and the sketch of the proof of the last-mentioned result is complete. 
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